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Redundancy is commonly used to guarantee continued functionality in networked systems. How¬ 
ever, often many nodes are vulnerable to the same failure or adversary. A “backup” path is not 
sufficient if both paths depend on nodes which share a vulnerability. For example, if two nodes of the 
Internet cannot be connected without using routers belonging to a given untrusted entity, then all of 
their communication-regardless of the specific paths utilized-will be intercepted by the controlling 
entity. In this and many other cases, the vulnerabilities affecting the network are disjoint: each node 
has exactly one vulnerability but the same vulnerability can affect many nodes. To discover optimal 
redundancy in this scenario, we describe each vulnerability as a color and develop a “color-avoiding 
percolation” which uncovers a hidden color-avoiding connectivity. We present algorithms for color¬ 
avoiding percolation of general networks and an analytic theory for random graphs with uniformly 
distributed colors including critical phenomena. We demonstrate our theory by uncovering the 
hidden color-avoiding connectivity of the Internet. We find that less well-connected countries are 
more likely able to communicate securely through optimally redundant paths than highly connected 
countries like the US. Our results reveal a new layer of hidden structure in complex systems and can 
enhance security and robustness through optimal redundancy in a wide range of systems including 
biological, economic and communications networks. 


I. INTRODUCTION 

Many real-world complex systems, which we model as 
networks, display disjoint vulnerability to failure or at¬ 
tack. These vulnerabilities make networks far less robust 
than they seem. It is generally assumed that redundant 
connections through multiple paths improves robustness 
mm but if a given vulnerability affects a large set of 
nodes, this may not be the case. For example, if one node 
can not communicate with another without routing the 
information through routers under a given entity’s con¬ 
trol, secure communication is compromised. Similarly, in 
an economic network, if a firm has redundant suppliers 
but each supply chain includes nodes belonging to a given 
company, then there is an absence of competition-even 
if in principle there are multiple competing companies 
working in that sector. Similar considerations hold for 
nodes in a spatial network that are located near one an¬ 
other because transportation and economic assets in the 
same city will be affected by the same weather events 
or disasters mg. Disjoint vulnerabilities also appear 
in biological networks. Depending on the type of nutri¬ 
ents available, different metabolic pathways are enabled 
mm- In this case, the metabolic network is disjointly 
vulnerable to the absence of a certain type of nutrient. 
Robust functionality can be guaranteed only if there are 
paths connecting source and target metabolite even when 
each distinct nutrient is removed. Gene regulatory net¬ 
works exhibit similar multipath responses to environmen¬ 
tal conditions mm- 

In all of these cases, connectivity alone gives a poor 
picture of the network’s robustness and security. How¬ 
ever, since susceptibility to one vulnerability often pre¬ 
cludes susceptibility to another vulnerability, we can par¬ 
tition the network into disjoint subsets by vulnerability. 
The disjoint nature of the vulnerabilities allows for robust 


connectivity to be established, provided the network re¬ 
mains connected when each subset is removed. Here we 
present a new framework for analyzing disjointly vulnera¬ 
ble complex networks and show the conditions for which- 
even if every node is vulnerable-robust connectivity can 
be maintained. 

We model disjoint vulnerability by assigning every 
node in the network exactly one color, representing ex¬ 
actly that vulnerability. The color may represent owner¬ 
ship, geographical location, reliance on a critical mate¬ 
rial or some other vulnerability. Similar to polychromatic 
percolation mM, we consider the components formed 
by nodes of different colors separately. We then develop 
a “color-avoiding percolation” theory which allows us to 
determine the connectivity of the network when each 
color (ie, the set of all nodes of a given color) is removed. 
The set of nodes that are mutually connectible under the 
removal of any color comprise the color-avoiding giant 
component. The existence of this component indicates 
whether or not the disjoint vulnerabilities can be avoided 
or not. 


II. COLOR AVOIDING PERCOLATION 

On a non-colored network, if node or link failures oc¬ 
cur with a given probability, percolation theory can be 
used to determine overall connectivity [ails]. Perco¬ 
lation on complex networks has a rich history EHIH]. 
It has been used to study the resilience of the internet 
[m |20] , its susceptibility to virus spreading [21] and even 
in probabilistic routing algorithms [22]. It has also been 
used to understand word-of-mouth processes in social 
networks [23] [24], and the robustness of many biologi¬ 
cal networks including neural networks [25], metabolic 
networks [26] and mitochondrial networks [27]. Here we 
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FIG. 1: Illustration of color-avoiding connectivity, (a) In this network the sender S and the receiver R are color-avoiding 
connected (CAC), as the green path avoids black and white nodes, and the purple path avoids blue nodes, (b) Finding £coior? 
the largest color-avoiding connected component, (bl) The largest components without white (b2) without blue 

(£2) cind (b3) without black nodes (£3) are highlighted in red, in each frame. (b4) Considering all of the nodes which are 
either in the largest components without each color or connected to them, we arrive at the largest CAC component, £coior, the 
red nodes. Every pair of nodes in £coior is CAC. Note that some nodes are not color-avoiding connected but are necessary to 
form the color avoiding components, (c) Estimation of the fraction of color-avoiding connected pairs Ppair for quenched graphs 
with different values AScoior- Red squares show Poisson graphs with N = 10^ nodes, average degrees k = 1.6; 1.7; 1.9; 4.0 and 
C = 3 colors, the green circle shows the AS network with colors representing the countries which the AS are assigned to [12 US]. 
The black line indicates the case where AScoior accounts for all of the color-avoiding connected nodes. Deviations are only visible 
for the smallest value shown, with iScoior = 570. Ppair was approximated with samples of up to 5 x 10^ pairs, error-bars are 
smaller than the symbols where not visible. 


develop a new framework based on percolation theory but 
not reducible to any previous percolation problems. In 
this framework, connectivity corresponds to the ability 
to avoid disjoint vulnerabilities via multiple paths. 

We begin with an undirected unweighted network G 
with N nodes and adjacency matrix Aij. Every vertex 
i is assigned a color q G {1,2,. where C denotes 
the total number of colors. Faced with the possible vul¬ 
nerability or insecurity of all nodes of a single color, we 
seek a set of paths between two nodes such that no color 
is required for all paths. In non-colored graphs, a single 
path provides connectivity and in k-coie percolation any 
k paths are sufficient [28l |29] . We now define a pair of 
nodes as “ color-avoiding connected” (CAC) if, for ev¬ 
ery color c, there exists a path connecting this pair and 
avoiding all nodes of color c. We assume that the source 
and target themselves are secure, and their colors are not 
included in the calculation of color-avoiding connectivity. 
The paths are not necessarily unique: often one path can 
avoid multiple colors (see Figurej^). However, if C paths 


cannot avoid all C colors, then the source and target re¬ 
quire one of the colors to be connected and adding more 
paths will not help. Since avoiding disjoint vulnerabili¬ 
ties through multiple paths is a feasible strategy only if a 
giant CAC component exists, we do not address optimal 
path problems but rather focus on the properties of CAC 
components. 

Formally, we define a “color-avoiding connected com¬ 
ponent” as a maximal set of nodes, where every node 
pair in the set is color-avoiding connected. Several ex¬ 
amples of CAC components are shown in Fig. Ef and 
Supp. Fig 1. Note that there are nodes which are not 
themselves part of the CAC component but are neces¬ 
sary for the color-avoiding connectivity of nodes which 
are in the component. This occurs, for example, when 
all of the neighbors which lead from a node to the CAC 
component are of the same color. In such a case, the 
node itself is not CAC to the system as a whole because 
it must pass through nodes of a certain color before it 
can reach elsewhere. However, in general, this node will 
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still be necessary to form paths which avoid other colors. 
The fact that non-CAC nodes may be needed to create 
overall system color-avoiding connectivity is one indica¬ 
tion that a new kind of percolation theory is needed to 
uncover this hidden structure. 

By studying the largest CAC component, we obtain a 
clear quantitative measure of the feasibility of multiple 
paths to avoid disjoint vulnerabilities and information on 
where those paths should be routed. Furthermore, this 
gives us a way to measure the effect of changes in network 
topology, link density and color distribution. 

To find the largest set of color-avoiding connected 
nodes in any network with any color distribution, we 
propose the following algorithm. First, for every color 
c, we delete all nodes with color c and find the largest 
component in the remaining graph, Cc- Next, we de¬ 
fine £coior as the set of nodes which, for every color c, 
are either (a) in Cc or (b) have at least one link to it. 
Condition (b) represents the assumption that the color 
of the source and target are not included in the calcu¬ 
lation. If we only used condition (a), the calculation of 
>Ccoior from {Cc} would be equivalent to the calculation 
of the mutual giant component in interdependent m or 
multiplex networks [siiia and the result would always 
be an empty set because every node has some color c' 
and is therefore not a member of C^'. In Figure [^, we 
illustrate this method and further technical details are 
discussed in Supp. Sec. l.A. 

It is possible that >Ccoior does not represent the overall 
color-avoiding connectivity of the system due to smaller 
components. However, if >Ccoior scales with system size 
and the smaller color-avoiding connected components do 
not, then in the limit of large systems the overall color¬ 
avoiding connectivity is determined by Ccoior just like the 
overall connectivity is determined by the size of the giant 
component in non-colored graphs. With 5'coior defined as 
the fraction of the total nodes which are in £coior and 
Ppair defined as the total fraction of color-avoiding con¬ 
nected pairs among all node pairs, we can test if £coior 
accounts for the bulk of color-avoiding connectivity. In 
Figure we see that color-avoiding connectivity is in¬ 
deed dominated by Ccoior for random and real-world net¬ 
works. When 5'coior is small, non-giant clusters and the 
trivial color-avoiding connectivity which accompanies in¬ 
dividual links leads to deviations between Ppair and 5'coior 
but these deviations rapidly disappear as the sytem size 
increases. This validates the treatment of Ccoior as a 
proxy for color-avoiding connectivity. We proceed to de¬ 
velop analytical results based on percolation theory for 
random networks. 


III. ANALYTIC THEORY FOR RANDOM 
NETWORKS 

For the analytical treatment we use the annealed ap¬ 
proximation of networks of size N described through the 
configuration model m, in which a degree distribution 


p{k) is a conserved quantity from which an ensemble of 
network realizations is drawn. For a more comprehensive 
treatment see the supplementary information. 

Every node i is assigned a color q G {1,2,...,C}. 
The analytic framework presented here assumes that the 
colors are distributed uniformly at random. Hence, the 
color sequence {q} has probability Yl^r^ with the color 
frequencies Tc- 

We calculate 5'coior in the limit of A ^ oo as the 
probability that a single node belongs to Ccoior- Because 
>Ccoior is a subset of the regular giant component by con¬ 
struction, we begin by obtaining the solution for stan¬ 
dard percolation on random graphs [niissiisi]. The size 
of the giant component in a non-colored random graph 
is S = 1 — go{u) where ^o(^) = is the gener¬ 

ating function of the probability distribution u is 
the probability that a node is not connected to the gi¬ 
ant component over one particular link and is computed 
as the solution of u = gi{u), where gi{z) = ^o(^)/^o(l) 
is the generating function of excess degree m- Second, 
we let K,c be the expected number of a randomly chosen 
node’s neighbors of color c which are connected to the gi¬ 
ant component of standard percolation. Considering Kc 
for all colors, we obtain the vector k = (/^i,..., kc) with 
k' = tvc being the total number of links to the normal 
giant component. Third, the conditional probability Pj^ 
that the links suffice to connect to Ccoior, given that they 
belong to distribution k and that they already belong to 
the normal giant component, is: 


c 


= n (i - , 

C=1 

(1) 

U — 1 ^ ~ 

(2) 


in which Uc denotes the conditional probability that a 
link fails to connect to Cc given that it does connect to 
the normal giant component via a node having a color 
d ^ c. We define Uc = f if u = 1. The probability Uc that 
a single link does not connect to a giant Cc is calculated 
with Uc = Tc -\- {f — rc)gi{ud) (site percolation with a 
surviving fraction of nodes of 1 — rc m)- Combining 
these terms, we obtain a formula for 5'coior- 

oo k k' 

^color ^ ^ Pk ^ ^ Bk,k' ^ ^ ^k' (3) 

k=0 k'=0 K,i,...,Kc=0 

where the binomial factor Bk,k' (Supp. Eq. S7) accounts 
for the probability that out of k links U links connect 
to the normal giant component. The multinomial factor 
(Supp. Eq. S8) gives the multinomial probability 
of having the color distribution R among the neighbors 
belonging to the normal giant component. 

To obtain a closed-form solution for 5'coior, we now 
assume that every color occurs with equal probability: 
Tc = 1/C. With Ui = Uc being identical for all colors we 
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FIG. 2: Size of the giant color avoiding component AScoior in random networks with uniformly distributed colors. 

Dependence of Scoior on average degree k (a) for Erdos-Renyinetworks and (b) scale-free networks with different numbers of 
colors. Error bars are shown but barely visible for networks of size N = 10®. The blue lines show the corresponding analytical 
results. Eor comparison, we include the giant component size of standard percolation S (black solid) and the limiting case of a 
system with an infinite number of colors, Sco\or,oo (black dashed). As mentioned in the text, Sco\or,oo is the same as the giant 
component in 2-core percolation, (c): Critical exponent and finite size scaling for Erdos-Renyinetworks with C = 3. Note that 
in the critical region the theory and simulations show a slope of almost exactly 3 as predicted by Eq. Einite size scaling is 
shown with the results of > 150 realizations per size plotted individually and averaged. 


have (Supp. Eq. S20): 

c 

5'color,C = 


j=0 

X go\u-\-{l-u) 




c 


(4) 


We now discuss the limiting cases C = 2 and C ^ oo. 
The result for two colors can be simplified to (Supp. Eq. 
S17) 


5'color,2 = 1 - 2 c/o(mi) + go{2ui - 1) 


(5) 


which directly depends on ui only. As the number of 
colors tends to infinity, standard percolation is not re¬ 
covered and 5'coior remains smaller than the relative size 
of the giant component S and in fact 5'coior,oo is iden¬ 
tical to the giant component in k-coie percolation with 
k = 2 [ISlIini. The reason that 5'coior,oo is equivalent to 
2-core percolation is that-even if every node is a differ¬ 
ent color-if a node were connected via only one link, it 
would not be able to avoid the color of its sole neighbor. 
We demonstrate this directly by deriving an asymptotic 
form for 5'coior as C ^ oc (Supp. Eq. S23): 


^color,oo — S (1 '^) 


dgo{z) 


dz 


(6) 


which is the same result as in 2-core percolation. In Fig. 

t we see that 5'coior,c comes close to S'coior,oo even for 
= 10, indicating that even moderate color diversity 
comes close to the infinite color case. 

We now discuss graphs with broad degree distribu¬ 
tions with pk ^ k~^ {k > 0) and generating functions 
9o{z) = lAa{z)/C{a) and gi{z) = lAa-i{z)/[zC,{a - 1)], 
with Lia{z) the polylogarithm function. In Figure]^ we 


see results for C = 2 and C = 10 depending on the aver¬ 
age degree k = C(a — 1)/C(a) [15]. The limiting cases are 
diverging k for a = 2 and k = 1 for a ^ oo. We see that 
^crit is not strongly affected by the number of colors but 
that the size of the giant CAC component is substantially 
smaller than in the case of Erdos-Renyinetworks (see Fig¬ 
ure [^-b). The critical connectivity can be calculated us¬ 
ing Cohen’s criterion for site percolation m- With the 
fraction 1 — rc of nodes surviving random removal, we 
obtain 1 — Tc = 1 — 1/C = k/{(k^) — k). Since (k^) = 
C((a —2)/C((a), we have C((a —2)/C((a —1) = 1 + C/(C —1). 
Accordingly k ~ 1.254 for two colors, and it converges to 
k ^ 1.195 for C ^ oo. 

We find that Erdos-Renyi networks are more color¬ 
avoiding connected than scale-free networks of equal av¬ 
erage degree, the opposite of the results for resilience to 
random failures [T^ |20l [35] . This follows from the dif¬ 
ference in the 2-core envelopes; compare Figs. and 

Eb. 


IV. CRITICAL PHENOMENA 

We now turn to the critical behavior of S'coior in Erdos- 
Renyi-graphs with C uniformly distributed colors. Sim¬ 
ilar to standard percolation, we find that the size of the 
largest color-avoiding connected component S'coior under¬ 
goes a phase transition at a specific ferit, which is now 
determined by the number of colors see Figure For 
k < ^crit, color-avoiding connectivity is confined to clus¬ 
ters of finite size (zero in the limit of large N) and for 
k > ^crit there is a largest color-avoiding connected com¬ 
ponent 5'coior which scales with system size. We find that 
the value of ^crit decreases as C increases and approaches 
the standard percolation threshold as C ^ oo. Since 
color-avoiding connectivity requires that the giant com- 
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FIG. 3: Color-avoiding connectivity of the AS-level internet, (a) Here we show the routers of the AS-level internet 
in the Iberian peninsula as a disjointly vulnerable network with the colors determined by the country to which the router is 
registered, (b) Using these colors, we calculate the largest color-avoiding connected component. The green nodes are members 
of this set while the red are not. This means that these routers can take advantage of multiple paths to maintain security, as 
desribed in the main text, (c) This shows the number of CAC routers (nodes in Lcoior) compared to total number of routers 
for the top 20 countries worldwide, in terms of total number of AS routers registered to that country. Data for the US has 
been trunctated for visibility, the total number of AS routers is 17690. We use a symmetrized version of the network of HU 
which was generated using data from the CAIDA project m up to December 2013. 


ponent not be destroyed after the removal of any single 
color, we require that k^ru ~ ^crit^^ where k^ru ~ ^ 
is the percolation threshold for ER graphs and is 
the fraction of links remaining after the removal of 1/C 
nodes. Therefore ^crit = C/{C — 1). 

To discuss the scaling and critical exponents, we return 
to the definition of P/^, Eq. We consider the region 
close but above ^crit by defining e = l — Ui^C{k — ^crit) 
which holds as long as (k — kcrit) 1/C (Supp. Eq. S27). 

We analyze the behavior of P/^ for small 6 by expanding 
(1 — {k' — Kc)e. Plugging this approximation 
in to Eqs. [^andj^we obtain: 

^color OC (^k ^crit)^ (7) 

^ = ferit=C/(C-l). (8) 

We confirm the value of ^crit and the scaling of Pcoior 
numerically in Eigure for (7 = 3 colors. As (7 ^ oc, 
we need to resolve the seeming contradiction of a diver¬ 
gent critical exponent (3 = C and convergence towards 
5'coior,cx3 as it appears in Eq. Eor ER networks we 
show (Supp. Eq. S31) that 5'coior,oo oc (k — 1)^ for k near 
1, implying /3 = 2. The reason that we do not observe 
/3 ^ oc as described in Eq. is that the approximation 
used to obtain Eq. [^is only valid in a critical region de¬ 
fined as (^ — ^crit) ^ 1/C. As(7^oc, Pcoior incrcascs 
with the high exponent (3 = C. However, the shrinking 
critical region overpowers the diverging critical exponent 
and Pcoior ^ 0 takes on unobservably small values and 
crosses over to /3 = 2 scaling outside the critical region. 


V. APPLICATIONS 

One immediate application of our framework is to se¬ 
cure communication in a network with no trusted nodes. 
Assuming C router owners, each of whom eavesdrops on 
its routers traffic, we can securely communicate if mes¬ 
sages are split with a secret sharing protocol j36H38] and 
transmitted along multiple color-avoiding paths. The 
nodes which can take advantage of this method are ex¬ 
actly the elements of the largest CAC component. 

To study the hidden CAC structure of the internet, we 
use a symmetrized version of the AS-level internet pre¬ 
pared by [12] which was generated using data from the 
CAIDA project m up to December 2013. We then color 
every router according to the country to which the router 
is registered, reflecting the assumption that every coun¬ 
try is eavesdropping on its traffic but that no countries 
share information (Eig. Hf). Using the algorithm for find¬ 
ing the largest CAC component, we can determine which 
nodes are color-avoiding connectable and which are not 
(Fig. It). 

We find that overall 26228 out of 49743 (?^ 52.73%) of 
the routers are in the largest CAC component and that 
this accounts for the vast majority of CAC connected 
nodes (Eig. [^). However, we also find that these results 
vary greatly from country to country. Eor instance, only 
25% of the routers registered to the United States are in 
the largest CAC component compared to 89% of routers 
registered to Russia (Eig. |^. This is partially due to 
the density of routers in the US which is much higher 
than Russia and indicates that US eavesdroppers have far 
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greater capacity to intercept communication than their 
Russian counterparts. 

In economic trade networks, it is common that a sin¬ 
gle firm controls many others [39] but each firm is con¬ 
trolled by only one owner. The vulnerability to correlated 
failures or malicious activities can undermine the overall 
system robustness, if they are sufficient to disrupt the 
global color-avoiding connectivity. We thus add color¬ 
avoiding connectivity to the concerns regarding systemic 
risk and government regulation of mergers and acquisi¬ 
tions [401 E]. 

In epidemiology, many diseases spread via different 
strains, and individuals may become immune after re¬ 
covery [42] . Coloring nodes by strain, color-avoiding per¬ 
colation can be used to evaluate the population’s suscep¬ 
tibility to a multi-strain infection. 

VI. DISCUSSION 

We have presented here the first systematic study of 
disjoint vulnerabilities in complex networks and a way to 
maintain network robustness by utilizing multiple paths. 
We have shown that even a small diversity of colors can 
enable color-avoiding connectivity to a large fraction of 
nodes in a random network but that in real-world net¬ 
works, uneven distribution of vulnerabilities can under¬ 


mine this effect. The framework and metrics uncover 
a hidden structure that underlies any complex network 
with nodes that can be partitioned by their susceptibil¬ 
ity to an external threat and can be used to devise new 
network design principles and protocols for improving ro¬ 
bustness through redundancy. 
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Appendix A: Supplementary Information 


List of variables 


Networks 

N Number of nodes 

k Average degree 

ki Degree of node i 

Pk Degree distribution 

a Exponent of scale free degree distribution 

go Generating function of degree 

gi Generating function of excess degree 

Golors 

C Number of colors 

c G 1, 2,... C A color 

Tc Golor distribution 

Standard percolation ingredients 

C Set of nodes in the largest component (color blind) 

u Prob. of not being connected to giant comp, over a link 

S Size of giant component 

Cc Set of nodes in the largest component, after nodes of color c deleted 

Uc Prob. of not being connected to giant Cc over a link 

Sc Size of giant Cc 

Percolation over color avoiding paths 

£coior Gandidate set of nodes for the largest avoidable colors component 

^coior Size of giant CcoXor 

Bk^k' Prob. that out of k links k' connect to giant component 

Mk'^i^ Prob. that out of k' links ki connect to color 1 etc. 

Pj^ Success probability having neighbors of colors ace. to ^ 

Uc Prob. that a link fails connecting to >Ccoior which already connects to C and a node not having color c 
Scolor,oo Size of the set of all nodes being connected to giant component over two links or more 

P Gritical exponent 

^crit Gritical value of average degree 




FIG. 4: Color avoiding components may overlap, as shown in (b) and (c). Color avoiding components can assume diverse 
forms. In a chain (a), paths between nodes of one color exist and can be reached by connections between nodes of different 
colors. In (b), the black node serves as an alternative path provider for the blue nodes. The graph (d) does not need any 
connection among nodes of the same color, but there is a massive overhead of nodes and connections to achieve color-avoiding 
connectivity of the blue nodes. A clique is a color avoiding component (e). 


Appendix B: Size of giant avoidable colors component in the configuration model 

We can find analytical results for 5'coior foi* random graph ensembles with randomly distributed colors in the limit 
of infinite graphs. These results can be used to estimate the situation in finite quenched networks. We are able to 
gain a general understanding including phase transitions. This knowledge can guide our understanding of real world 
networks. 

We use the generalized configuration model graph ensemble with N nodes, where each degree sequences {ki} occurs 
with probability YliPki^ with the degree distribution pk. Additionally we want to assign to every node i a color 
Ci G 1, 2,..., C. The color sequence {q} has probability Yl- Vc^ with the color distribution Tc- For a graph Gn out of 
the graph ensemble, >Ccoior has a certain size A'coior(GAr)- For the whole graph ensemble, we have to use the average 
value. By considering only giant contributions growing with network size, we have 

5coior = lim (SBl) 

N^OO iV 

Gn 

where P{Gn) = YliPki^ Yli is the probability to have the graph Gn of size A, including cj, the probability of the 
connection scheme of Gw as a matching of half edges. 


1. On the construction and maximality of £coior 

By construction, every node pair in Gcoior is color-avoiding connected. Furthermore, >Ccoior is maximal and therefore 
it is a color-avoiding connected component, if it includes for every color c at least one node out of £c- If for every 
color c, £coior includes at least one node out of jCc^ it is maximal and therefore it is an avoidable colors component. 
To prove this, assume it was not maximal. Then a node can be added which is (a) connected to every node in Gcoior 
and (b) excluded from >Ccoior; it does not connect to Cc' for some color c'. Consequently, it cannot connect to the 
nodes in >Ccoior which belong to jCc' ^ which contradicts (a). 


2. Question and connection to percolation theory 

For calculating 5'coior in the random graph ensemble, we will follow ideas of Erdos and Renyi [33] and Newman [34] . 
For calculating the size of the giant component, they used probabilities of connections for a single node in the graph 
ensemble. As we have to extend the method to a gradual procedure with conditional probabilities, it is useful to 
introduce the original method in detail with a shifted viewpoint. 

Lets denote with jC the set of all nodes belonging to the largest component. In figure on the outer left, a possible 
situation is illustrated. The largest component contains of a large part of the network, and the remaining nodes 
belong to smaller components. We have to calculate the size S of the giant component, meaning the average relative 
size of C in the network ensemble in the limit of infinite network size. For this we can define the average probability 
u that a node fails to connect to C over one particular link. This is illustrated in the figure with the left part. Again, 
the thermodynamic limit A ^ oo is implied. With the definition of u at hand, we can calculate S in two steps: First, 
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FIG. 5: We base our theory on the method to calculate the size of normal giant components, as illustrated in this figure. Using 
a self consistency equation, the probability u can be calculated. This is the probability, that a node is not connected to the 
giant component over a single link (see on the left). On the right, the probability for a node with k links is illustrated to have 
at least one link connecting to the giant component, is the probability that all links fail. 


using a self consistency equation, u is calculated. The probability u is identical to the probability that the neighbor 
does not connect to the giant component over any of the remaining links, 

u = gi{u), gi{z) = ^qkz^. (SB2) 

k 


In this equation, gi is the generating function of excess degree For important degree distributions 

as e.g. Poisson or scale-free, the equation for u can only be solved numerically. The second step is an averaging over 
nodes with various degrees k. The probability to connect to the giant component over any of k links is (1 — 
meaning that not all links fail at the same time. This is illustrated in the figure on the right. As a node which 
connects to the giant component belongs to it, 

CX3 

S = '^Pk{l-u'^) = 1 - go{u), go{z) = '^pkz^. (SB3) 

fc=0 k 



FIG. 6: We have to calculate the probability, if a node with k links is for every color c connected to the giant component Cc 
with deleted color c. All connections over at least one link have to exist at the same time. We illustrate this question with the 
three colors red (c =r), green (c =g) and blue (c =b). If a link connects to £g, it for sure does not connect to Cc for one of the 
other colors. This kind of dependence forces us to use a stepwise calculation with conditional probabilities. 

In analogy to the procedure described above, we will calculate S'coior as the probability that a randomly chosen node 
belongs to £coior- This has to be evaluated in the graph ensemble of infinite size. As we will perform an averaging 
over nodes with various degrees /c, the following question has to be answered: What is the probability that a node 
with k links connects to a giant Cc for all colors c at the same time. This is illustrated in figure On the left, the 
situation for a graph with colors on the nodes is illustrated. Nodes of all colors might be in the largest component. 
After deleting all nodes of one color c, the remaining largest component Cc might still contain a large part of all nodes 
in C. The condition for the node belonging to £coior is illustrated on the right of the figure. 

We will use the same two steps to attack this problem, as described for calculating the giant component above. 
First, we provide some single link probabilities which can be used as primitives for the further calculations. Second, 
we combine the single link probabilities to calculate S'coior- 














10 


3. Single link probabilities 



FIG. 7: Probabilities for a single link to connect to different parts of the network. We use these probabilities as primitives 
to calculate the probability for many links. While Vc and Uc can be calculated with standard methods invented for the 
configuration model before, the conditional probability Uc can be calculated as a combination of the others. 


We already gave equation |SB2| for calculating the probability u. In the case of colors on the nodes, as illustrated in 
figure on the left, the colors can simply be ignored. We further need the probability to connect to a node of color 
c which is simply Tc- This is illustrated in the second column of the figure. We further introduce Uc^ the probability 
that a single link does not connect to a giant £c- See the third column of the figure for an illustration. This can be 
calculated using percolation theory for random attack by solving 


Uc = rc + {I - rc)gi{uc). 


(SB4) 


Unfortunately, Uc cannot be used directly for calculating S'coior- If we look at the same link, the probabilities Uc 
are dependent for different colors. The most obvious argument is that always nc(l — Uc) = 0, as a link must at least 
miss one of the £c- Instead, we will use the conditional probability Uc, as illustrated with the outer right column of 
the figure. The precondition is that a link connects to the giant component and the node it connects to has not color 
c. Uc is the probability that such a link connects to £c- For calculating it, we use the primitives introduced so far, 
as illustrated in figure]^ Assuming independence of the probabilities (1 — u) for connecting to the giant component 
and (1 — Tc) for not connecting to a node of color c, the precondition of Uc can be constructed. In this way, we can 
construct (1 — Uc) using the probability we are searching for: (1 — Uc) = (1 — u){^ ~ u)(l ~ lA)- With this we find 


Uc- = 1 - 


I - Uc 

(l-u)(l-rc)' 


(SB5) 


If (1 — rc) = 0, the precondition holds for an empty set of nodes. In this case we define Uc = 1. Notice that the 

additional information of the explicit color, instead of only stating that the color is not c, does not alter the results, as 
a further restriction of the colors would meat the numerator and denominator identically and therefore would cancel 
out. 
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; 1-C/g 



FIG. 8: This figure illustrates the calculation of t/g using the equality (1— u)(l — rg)(l — Fg) = 1 — Ug. For that, we have 
assumed independence of the qualities of the link under consideration, especially of the color it connects to and if it connects 
to the giant component. 


4. Averaging over link distributions 


As in equation SB3| fQr the giant component, we want to get an analytical result for 5'coior by averaging over possible 
link constellations of a randomly chosen node. Let us give the whole result and then explain it step by step afterwards: 


oo k k' 

Scolor — ^ ^^ Pk ^ ^ ^ ^ ^k' 

k=0 k'=0 Ki,...,Kc=0 

Bk,k' = 


k'k-k' 


= 


k'\ 


Ki\ X • • • X Kcl 
C 




(SB6) 

(SB7) 

(SB8) 

(SB9) 


C=1 


The formulas include the single link probabilities rc, u (equation ( SB2[ )) and Uq (equation ( |SB5 ) with (SB4)). An 
illustration of the procedure can be seen in figure is the binomial probability that out of the k links k' 

links connect to the giant component. M ]^>gives the multinomial probability for a certain color distribution among 
the k' links connecting to the giant component. We assume that this second step is independent of the first step, 
what is confirmed with the final results. The numbers Kc count the links which connect to a node of color c in the 
giant component. Finally, Pj^ gives the joint probability that for the color distribution given by k all giant Cc are 
connected to at the same time. There is at least one link connecting to Cc with probability 1 — (Uc)^ . The success 

probabilities for different colors have to be multiplied, as all Cc have to be reached at the same time. We tested 
numerically that e.g. U\ and are independent for a link connecting to the giant component and a third color. 


Appendix C: Examination of iScoior 
1. Closed form solutions 

We now calculate closed form solutions for 5'coior for special cases. This is done to demonstrate how the extensive 
summations over k\ k and k can be performed analytically. In cases where this is not possible, a sampling of values 
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FIG. 9: For calculating the probability of a node with k links to belong to £coior, we have to average over different link 
constellations which this node might show. First, y is the probability that out of the k links k' connect to the giant 
component. It is calculated using u (compare figure on the left). Second, gives the probability for a certain color 

distribution among the links. It is calculated using Vg etc. (compare hgure|^ second from left). We assume that this second 
step is independent of the first step, what is conhrmed with the final results. Third, Pj^ gives the joint probability that for this 
color distribution £f, connected to at the same time. This is calculated using Uf etc. (compare hgurej^on the 

right). 


K has to be performed. The results can be tested against the analytically tractable situations and by comparing with 
numerical results. The closed form solutions presented here were used to calculate analytical results for the main 
article as well. 


For evaluating equation |SB6| with two colors, we first rewrite 


o'y 


k' 

1^1 J « 2=0 


/ti=0 



{ny{r2f~^^ 


-{riy{r2Uif-^^ + {riU-^y{r2Uif 


1 - (ri + r^Uif - (r 2 + + (nCg + r^Uif. 


(SCI) 

(SC2) 

(SC3) 

(SC4) 


In the last step, the binomial formula was used backward. We can use this procedure once more, and with equation 


SB5 
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and ri + r 2 = 1 we find 


^color — ^ ^ ^ -^/c, 


k'O'k' 


k'=0 

k 


= ^pk E 


k'=0 


k' 


k-k' 


{l-uf -{{l-u){n+r 2 Ui)f - ... 


k' 


J2Pk [1-{UJ)^-{U2)’^ + {UI+U2-If] 

k 

1 - 5o(wi) - 50 (^ 2 ) + 5o(mi +U 2 - 1). 


(SC5) 

(SC6) 

(SC7) 

(SC8) 


This result holds for any degree distribution and color distribution. Notice that Tc < < 1- The result for two colors 

does only depend on the probabilities Uc^ while conditional probabilities as Uc were eliminated. This was possible as 
Cl and £2 not overlapping for two colors. For Poisson graphs we find with the according generating function 


go{z)=g,{z) = e^^^-^\ 

S'color = [1 - 5'o(wi)][l -5 o(w2)]- 


(SC9) 

(SCIO) 


For more than two colors, Cc do overlap. For homogeneous color distributions Vc = l/C", a closed form solution can 
be found in the same way as for two colors with the binomial formula. We find 


^color 


E(-i)^ 

i=o 


C 


9o 




(SCll) 


Let us finally discuss the behavior for (7 ^ oc. This can be done utilizing the term the pro babilit y that a node 
connecting over k' links to the giant component belongs to 5'coior- As can be seen with equation SB9[ ao = cri =0. 
On the other hand, with Tc ^ 0, equati on |SB4| converges to equation |SB2| and therefore Ui 0. This means that 
^ 1- We finally find with equation |SB6 


^color.oo — ^color 

C^oo 


= 1 - '^Pk[u’" + k{l - uY 




/c=0 


= 1 - go{u) - (1 - m ) 


dgoiz) 


dz 


(SC12) 

(SC13) 

(SC14) 


2. Critical behavior for Poisson graphs 

With equation SB6[ vanishing dk' causes S'coior = 0. According to 


k' 


C 


ak>= E 


K,i,...,K,C=0 


C=1 


(SC15) 


this is th e cas e if Uc = I for any color c. With equation |SB5| we find that Uc = f whenever = 1. Examining 
equation SB4| for Uc^ we can relate to site percolation (random removal of nodes). For Poisson graphs we have 
^crit = (^ — f)/k. With homogeneous color distribution Tc = 1/C, we can resolve the critical connectivity given the 
number of colors 


^crit=C/(C-l). 


(SC16) 


The normal giant component size S shows a special critical behavior shortly above the transition point, it scales 
linearly with ^ — 1. Here we are interested in the b ehav ior of S'coior which is a function oil — ui which itself can be 
related to 1 — = S'. With inserting into equation SB4 it can be shown that Ucik) = rc + (1 — rc)u{{l — rc)k). For 

small arguments (k — ^crit). 


l-ui{k> /Ccrit) - (1 - ri) 


2 d(l -u) 


dk 


{k ^crit)‘ 


(SC17) 


/c=l+0 
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Inserting into equation 


SB5 


we find using 1 — u{k > 1) 


d(l—u) 
dk 


/c=l+0 

e = 1 - « C(k - k„it) 


(fc- 1) 


(SC18) 


if additionally k — ^crit ^ ^ — 1 holds {1 — ui small compared to 1 — u). 

For calculating we first need to evaluate Pj^ including expressions 1 — (Ui)^ . Replacing with e and applying 

an approximation we find 1 — = 1 — (1 — 5 )^ ^ {k' — Kc)e. This is true at least as long as k'e 1. With 

this we find ex (k — ^crit)^ independent of and finally 

(Tfe/ (X (k- k„it)^. (SC19) 


We finally find 


'^color OC (A) kcYit)^f 

/3 = C. 


(SC20) 

(SC21) 


The critical behavior of S'coior,oo for Poisson graphs can be evaluated with the generating function eq. dS^ and 
S = 1 — u. We find S'coior.oo = S — kS{l — S), and for small positive A: — 1 the giant component grows approximately 
with S ~ 2(k — l)/fc^. Therefore 

Cooler,CO ^{k- 1)2(4/A2 - 2/p) x{k- 1)^ (SC22) 
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